Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

Feeds to Scour
SubscribedAll
Scoured 15849 posts in 481.0 ms
PRIMAL: Processing-In-Memory Based Low-Rank Adaptation for LLM Inference Accelerator
arxiv.org·15h
📊Quantization
Preview
Report Post
No Libraries No Shortcuts: Reasoning LLMs from Scratch with PyTorch — Part 2
pub.towardsai.net
·1d
📏Linear Logic
Preview
Report Post
From 75% to 99.6%: The Math of LLM Ensembles
shibaprasadb.com·12h·
Discuss: Hacker News
🧮Kolmogorov Bounds
Preview
Report Post
Privacy-Preserving Active Learning for heritage language revitalization programs with zero-trust governance guarantees
dev.to·10h·
Discuss: DEV
🔒Privacy Archives
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.com·4h·
Discuss: Hacker News
⚙️Batch Processing
Preview
Report Post
LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems
arxiv.org·15h
🌀Brotli Internals
Preview
Report Post
IPAB Workshop - 22/1/26 | IPAB
informatics.ed.ac.uk·10h
🎼Audio Lambda Calculus
Preview
Report Post
Quiz: How to Integrate Local LLMs With Ollama and Python
realpython.com·8h
⚙️WASM Runtime
Preview
Report Post
Using Local LLMs to Discover High-Performance Algorithms
towardsdatascience.com·2d
🧮SMT Solvers
Preview
Report Post
Everything Moe
ianbarber.blog·16h·
Discuss: Hacker News
🧠Learned Compression
Preview
Report Post
A Visual Guide to Quantization
newsletter.maartengrootendorst.com·2d
📊Quantization
Preview
Report Post
As Strong As Your Weakest Parameter: An AI Authorization Bypass
praetorian.com·4h
🎯Threat Hunting
Preview
Report Post
Can We Build an NX Bit for LLMs
bogdandeac.com·1d·
Discuss: Hacker News
🖥️Modern Terminals
Preview
Report Post
Redacting Faces, People, Vehicles, and Plates with Amped Replay Assisted Redaction
blog.ampedsoftware.com·5h
🧪Archive Fuzzing
Preview
Report Post
Evolution of LLMs use by a programmer
asfaload.com·3h·
Discuss: Hacker News
🧩WASM Components
Preview
Report Post
The coming industrialisation of exploit generation with LLMs
dev.to·1d·
Discuss: DEV
⚔️Lean Tactics
Preview
Report Post
Ensemble Listening Model (ELM): State-of-the Art Foundation Model Accuracy. A Fraction of the Cost.
ensemblelisteningmodel.com·1d·
Discuss: Hacker News
🎵Audio ML
Preview
Report Post
AI researchers map models to banish 'demon' persona
theregister.com·23h
🔍Vector Forensics
Preview
Report Post
Norm-Preserving Biprojected Abliteration
huggingface.co·2d·
Discuss: Hacker News
📊Quantization
Preview
Report Post
How Static Analysis Can Expose Personal Data Hidden in Source Code
hackernoon.com·8h
📊Static Analysis
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help